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Abstract 



Using data from 92 indices of stock exchanges worldwide, I analize the cluster formation and evolution 
from 2007 to 2010, which includes the Subprime Mortgage Crisis of 2008, using asset graphs based on distance 

■ thresholds. I also study the survivability of connections and of clusters through time and the influence of 
noise in centrality measures applied to the networks of financial indices. 

R 

^ 1 Introduction 

.5- 

■ The study of the dynamics of clusters has been occupying the minds of many researchers in the past years. 
q-i In particular, one would like to study how clusters form and evolve in time. In the present work, I build and 

analize networks based on 92 indices of stock exchanges around the globe. The indices cover the five habitable 
continents, spanning a variety of markets and economies. In a companion article [I], I analyzed data from 
some of the major international financial crises of the past decades using asset graphs in order to visualize the 
| formation and evolution of clusters. There, I also analyzed how the second highest eigenvalue for the correlation 
■^j- ■ matrix of indices may show an internal structure peculiar to financial market indices which is a consequence of 
them operating at different times. Here, I again use asset graphs in order to study the formation and evolution 
1 \ of financial markets indices, but now covering the years from 2007 to 2010. These years include the recent 

■ Suprime Mortgage Crisis, that reached its peak in 2008, and the succeeding Credit Crisis that lasts up to these 
t-h days. 

Asset graphs have been used with a variety of financial data [2]-[9], and they often provide a graphic 
visualization which is complementary to the ones of Minimum Spanning Trees and Planar Maximally Filtered 
Graphs (see [11] for examples of both types of graphs). An asset graph is a subset of the whole network based 
C3 [ on a distance measure obtained, as an example, from a correlation matrix built from the time series of some 
variables, where every node is connected to all others. In order to restrict the number of nodes and of edges 
(connections) between them, a threshold is established, which may be the total number of allowed edges in the 
graph or a value bellow or above which no connection is to be considered. 

In Section 2, correlation matrices based on the time series of benchmark indices of a variety of stock 
exchanges are built using Spearman rank correlation. The reason why I am using Spearman correlation and 
not the usual Pearson correlation is because the former is more useful when measuring linear relations between 
variables, and the latter is better at measuring nonlinear relations between them (there is a comparison between 
both coefficients in Subsection 2.9). Each correlation matrix so obtained is then used in order to build a 
measure of distance between nodes (indices), and then threshold values are chosen so that all distances bellow 
the threshold are used in order to create an asset graph, and the ones above it are discarded. By using a variety 
of thresholds, up to a limit where random noise becomes very intense, one is then able to witness the formation 
of clusters, first isolated and then unified as the threshold limit goes up. The upper limit for the threshold is 
obtained by using randomized data based on the original data so as to keep the original frequency distributions 
but destroy any temporal relation between them. In order for one to contemplate the true distances between 
indices, three dimensional maps are built using Principal Component Analysis |12j . 
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Section 3 considers the topic of survivability of clusters through time, which is essential, for instance, if 
one wishes to build risk-minimizing portfolios that are useful for future investments when based in past data. 
Single-step survival ratio [3] and a survivability measure developed in [13] are used in order to make this 
analysis, and the possibility of building asset graphs based on survivability is contemplated. 

Section 4 shows a study of some centrality measures of the networks obtained from the asset graphs and 
on how they change with the value of the threshold used in order to build each graph and also how the node 
degree evolve in time. Section 5 consists on a general coclusion. 

2 Asset Graphs 

Here I build asset graphs by assigning different threshold values for a distance measure obtained from the 
correlation between financial market indices. The data consists on the indices of 92 stock market worldwide 
(for a list of those indices, and their symbols in this article, see Appendix A), collected from the year 2007 to 
the year 2010. The index values are computed daily, and here represented by Pt for day t. Then, I calculate 
their log-returns, defined by 

Rt=]n(P t )-HPt-i) , (1) 

where Pt-i is the return on day t — 1. 

The log-returns are then used to calculate the correlations between pairs of indices. From the many correla- 
tion measures, I chose Spearman's rank correlation, for it is better at identifying nonlinear correlations between 
variables. The most common choice would be Pearson's correlation, and I make a comparison between results 
obtained based on both types of correlations later in this same section. Given a correlation coeficient Cjj in a 
correlation matrix of all indices, a distance measure is defined as 

dij = 1 - . (2) 

As correlations between indices vary from —1 (anticorrelated) to 1 (completely correlated), the distance between 
them vary from (totally correlated) to 2 (completely anticorrelated). Totally uncorrelated indices would have 
distance 1 between them. 

Based on the distance measures, three dimensional coordinates are assigned to each index using a procedure 
called Principal Component Analysis |12j . that minimizes the differences between the true distances and the 
distances represented in the graph. 

As some correlations may be the result of random noise, I ran some simulations based on randomized 
data, consisting on randomly reordering the time series of each index so as to destroy any true correlations 
between them but maintain each frequency distribution intact. The result of such simulations (1000 for each 
period being studied) is a distance value above which correlations are probably due to random noise (this 
value is usually close to d = 0.6). Then, asset graphs are built using distance measures as thresholds. As an 
example, for threshold T = 0.5, one builds an asset graph where all distances bellow 0.5 are represented as 
edges (connections) between nodes (indices). All distances above this threshold are removed, and all indices 
that don't connect to any other index bellow this threshold is also removed. 

In what follows, I show the resulting asset graphs concerning each semester of the years ranging from 2007 
to 2010, for thresholds varying from T = 0.2 to T = 0.5. These values were chosen so as to help visualization, 
which becomes harder for higher thresholds and many more connections. The separation into semesters is 
based on the constatation that correlations between markets may vary somewhat in brief periods of time, and 
that financial crises tend to concentrate on half of a year (usually the second semester). 

2.1 First Semester, 2007 

Figure 1 shows the three dimensional view of the asset graphs for the first semester of 2007, from T = 0.2 to 
T = 0.5. 

At T = 0.1, there are already strong ties between S&P and Nasdaq, and between Fance and Germany, 
forming the seeds of the North American and European clusters. For T = 0.2, the European cluster grows 
strong, with the addition of the UK, Switzerland, Italy, Belgium, the Netherlands, and Spain. We also have, 



2 



for the first time, an African cluster, comprised of Namibia and South Africa. At T = 0.3, we see the growth 
of the now American cluster, with the addition of Canada, Mexico, and Brazil. The European cluster receives 
the addition of Ireland, Austria, Sweden, Denmark, and Finland. A Pacific Asian cluster is formed between 
Hong Kong, Singapore, and Indonesia. 

For T = 0.4, Argentina joins the American cluster, which is now denser. Denmark, Norway, Portugal, 
Greece, the Czech Republic, Poland, and Turkey join the European cluster. South Africa connects with the 
European cluster, and Australia makes connections to both the European and the Pacific Asian clusters. The 
latter is now comprised of Japan, Hong Kong, South Korea, Malaysia, Singapore, and Indonesia. At T = 0.5, 
Chile joins the American cluster, and Luxembourg, Hungary, Estonia, Lithuania, Russia, and Israel join the 
European cluster. Singapore and Namibia also establish connections with European indices. India and Taiwan 
join the now Asian cluster, which also becomes denser, and Cyprus connects with Greece. There is also a 
greater integration of the Asian and African clusters with the European one. At T = 0.6, the American cluster 
connects with the European one, and Peru establishes connections with Europe. Philipines connects with 
both Europe and Asia, and Egypt connects with Europe. For T = 0.7, noise starts to dominate. Colombia, 
Romania, Ukraine, and Thailand establish connections with existing clusters. More connections, many of them 
at random, are made for higher thresholds, and the last connections happen at T = 1.4. 
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Fig. 1. Three dimensional view of the asset graphs for the first semester of 2007, with threshold ranging from 
T = 0.2 to T = 0.5. 



2.2 Second Semester, 2007 

Figure 2 shows the three dimensional view of the asset graphs for the second semester of 2007, from T = 0.2 
to T = 0.5. 

At T = 0.1, we already have a North America cluster, comprised of S&P and Nasdaq, a well formed 
European cluster, whose members are the indices from the UK, France, Germany, Italy, Belgium, and the 
Netherlands, and an African cluster, formed by South Africa and Namibia. For T = 0.2, only the European 
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cluster grows strong, with the addition of Switzerland, Austria, Sweden, Finland, Norway, and Spain, and the 
formation of more ties between the previous members of the cluster. For T = 0.3, the now American cluster 
is joined by Canada, Mexico, Brazil, and Argentina. The European cluster has now the addition of Ireland, 
Luxembourg, Denmark, Greece, the Czech Republic, Poland, and Turkey. It also establishes strong connections 
with South Africa. Peru, Russia, and Cyprus also join the European cluster, although weakly. Peru connects 
with Denmark, Russia connects with Greece and Turkey, and Cyprus connects with Greece and Turkey. A 
new cluster, solely comprised of Pacific Asian indices plus the Australian index, is formed, comprised of Japan, 
Hong Kong, Taiwan, South Korea, Malaysia, Singapore, and Australia. 

For T = 0.4, the first connections appear between the American and the European clusters. The American 
cluster adds Chile, and connects with Peru. The European cluster is now even denser, and receives Portugal and 
Hungary. Connections are formed between the European cluster and Israel, and more connections are formed 
with the African cluster. The Pacific Asian cluster grows with the joining of Thailand and Indonesia, and it 
also grows denser. A single connection with the European cluster is made by Singapore and Norway, and a 
connection appears between Singapore and South Africa. At T = 0.5, a great unification of the world markets 
occur, and all the networks join at several nodes. Nevertheless, the only indices that join the network are 
Philipines, connected with the Pacific Asian cluster, and New Zealand, connected with Australia. At T = 0.6, 
most of the European connections are already formed, and the cluster receives the addition of Romania. India 
connects with all clusters at once, and new connections are formed among the existing clusters. At T = 0.7, a 
threshold at which noise starts to become important, Colombia, Iceland, Vietnam, and Tunisia join the global 
network. We also witness the formation of an Arab cluster, comprised of Jordan and the United Arab Emirates, 
connected with the Pacific Asian cluster by Malaysia, where the Islam is the official religion. The last links are 
formed at T = 1.4. 
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Fig. 2. Three dimensional view of the asset graphs for the second semester of 2007, with threshold ranging 
from T = 0.2 to T = 0.5. 



2.3 First Semester, 2008 

Figure 3 shows the three dimensional view of the asset graphs for the first semester of 2008, from T = 0.2 to 
T = 0.5. 
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Fig. 3. Three dimensional view of the asset graphs for the first semester of 2008, with threshold ranging from 
T = 0.2 to T = 0.5. 

At T = 0.1, the North American cluster, for now formed by S&P and Nasdaq, and the European cluster, 
comprised of France, Germany, the Netherlands, and Spain, are already present. For T = 0.2, the European 
cluster grows with the addition of the UK, Switzerland, Austria, Italy, Belgium, Sweden, and Finland. We also 
have the connection of Greece and Cyprus, and of Namibia with South Africa. At T = 0.3, the European cluster 
grows denser, and adds to itself Ireland, Luxembourg, Denmark, Norway, Portugal, and the Czech Republic. 
We also witness the formation of the Pacific Asian cluster, comprised of Japan, Hong Kong, South Korea, and 
Singapore. 

For T = 0.4, Canada, Mexico, Brazil, and Argentina join with S&P and Nasdaq to form the American 
cluster. The European cluster grows denser and joins with Greece, Hungary, Poland, Russia, Turkey, and 
Cyprus. We also have Taiwan and Australia joining the Pacific Asian cluster. At T = 0.5, the American and 
European clusters are linked via S&P and France, and via Brazil, with many European indices. Chile and Peru 
join the European and not the American cluster, and the European cluster establishes its first connections 
with Israel, South Africa, and Singapore. Croatia and Romania also join the European cluster, and the Pacific 
Asian cluster strenghtens its connections, also receiving the addition of India, now. 

At T = 0.6, true globalization occurs, as the American and the European clusters establish many con- 
nections, the European cluster strengthens connections with the Pacific Asian one, and also with the African 
cluster (Namibia and South Africa). New countries that join the clusters are Colombia, Slovenia, and Ukraine 
(with Europe), and China, Thailand, Malaysia, Philipines, and New Zealand (with Pacific Asia). We also 
witness the formation of an Arab cluster, first comprised of the United Arab Emirates and Ohman. 

At T = 0.7, a threshold for which noise starts to dominate, we see many more connections between the 
already participating indices, and the inclusion of Bulgaria, Estonia, Latvia, and Lithuania among the European 
cluster, of Indonesia in the Pacific Asian cluster, and of Bahrein, Qatar, and Pakistan in the Arab cluster. 
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Saudi Arabia, Egypt, and Kenya, also connect with other indices, but apparently at random. Slovakia, Serbia, 
Kazakhstan, Palestine, Lebanom, Jordan, Kuwait, Bangladesh, Mongolia, Vietnam, Tanzania, and Mauritius 
join at T = 0.8. The last connections are made at T = 1.4. 



2.4 Second Semester, 2008 

Figure 4 shows the three dimensional view of the asset graphs for the second semester of 2008, from T = 0.2 
to T = 0.5. 
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Fig. 4. Three dimensional view of the asset graphs for the second semester of 2008, with threshold ranging 
from T = 0.2 to T = 0.5. 

We are now dealing with a period of crisis, so connections are stronger now. At T = 0.1, we already have 
three clusters: the American cluster, formed by S&P and Nasdaq, the European cluster, comprised of the UK, 
France, Germany, Switzerland, Italy, the Netherlands, Finland, and Spain, and the African cluster, formed by 
Namibia and South Africa. At T = 0.2, the European cluster grows denser and adds to itself Austria, Belgium, 
Sweden, Denmark, and Portugal. Greece and Cyprus make a separate small cluster, which will soon join the 
European cluster at T = 0.3. At this threshold, Mexico and Brazil join the American cluster (Canada does not, 
for now). Ireland, Luxembourg, Norway, Greece, the Czech Republic, and Poland join the European cluster, 
and South Africa establishes connections with this same cluster. A Pacific Asian cluster begins to form between 
Hong Kong, Singapore, and Australia. 

For T = 0.4, Canada joins the American cluster, and Mexico, of the same cluster, establishes connections 
with Europe. Argentina links with Brazil, and Chile, Colombia, and Peru connect with Europe. Croatia, 
Slovenia, Romania, Estonia, Lithuania, Ukraine, and Turkey also join the European cluster. India, Japan, 
Singapore, and Australia also establish connections with Europe. An Arab cluster is formed between Bahrein, 
Qatar, and Ohman. The Pacific Asian cluster grows with the addition of Japan, Taiwan, South Korea, and 
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New Zealand. The African cluster links to both Europe and Pacific Asia. At T = 0.5, many connections are 
made between the American and the European clusters, and more connections exhist between Europe, Pacific 
Asia, and Africa. New indices that join Europe are the ones from Hungary, Bulgaria, Latvia, and Russia, and 
Thailand, Malaysia, Indonesia, and Philipines join the Pacific Asian cluster. The Arab cluster is still isolated, 
but now has the addition of Jordan, Kuwait, and of the United Arab Emirates. 

At T = 0.6, more connections are made between the already participating indices, Israel joins Europe, which 
is also joined by Serbia and Bosnia and Herzegovina. Montenegro joins with the Pacific Asian cluster, which 
may already be an effect of random noise. The Arab network now makes many connections with both Europe 
and Pacific Asia, and Mauritius joins with Australia. A connection appears between Ghana and Botswana, 
establishing a second African cluster, disconneted from all others. From T = 0.7 onwards, all clusters are 
connected, and random noise starts to dominate. The last connections are made at T = 1.3. 

2.5 First Semester, 2009 

Figure 5 shows the three dimensional view of the asset graphs for the first semester of 2009, from T = 0.2 to 
T = 0.5. 
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Fig. 5. Three dimensional view of the asset graphs for the first semester of 2009, with threshold ranging from 
T = 0.2 to T = 0.5. 



At T = 0.1, we have three clusters, the first comprised of the pair S&P - Nasdaq, the second an European 
cluster comprised of France, Germany, Italy, the Netherlands, and Spain, and the third formed by the pair 
Namibia - South Africa. At T = 0.2, the North American cluster is joined by Canada, and the European cluster 
is joined by the UK, Switzerland, Sweden, and Finland, and the pair Namibia - South Africa remains isolated. 
For T = 0.3, Mexico, Brazil and Argentina join S&P, Nasdaq and Canada to form the American cluster, the 
European cluster becomes denser and absorbs Ireland, Denmark, Norway, and Portugal. The South African 
cluster joins with the European one at many nodes, and Greece forms a cluster with Cyprus, still disconnected 
from Europe. In Pacific Asia, Hong Kong connects with Singapore. 

For T = 0.4, the American cluster, now joined by Chile, connects with the European one, which becomes 
even denser and is joined by Peru (a South American country), Austria, Belgium, Luxembourg, Greece, the 
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Czech Republic, Coratia, Poland, Romania, Russia, Turkey, and Israel (a country from the Middle East with 
strong ties to Europe). The Pacific Asian cluster gorws with the addition of India, Japan, Taiwan, South 
Korea, Thailand, Indonesia, and Australia. At T = 0.5, we have more connections between the American and 
the European clusters. The American cluster is joined by Colombia, and makes connections with the African 
cluster. The European cluster is joined by Hungary and makes connections with the Pacific Asian cluster, 
which is joined by China, Malaysia, and Indonesia. New Zealand connects with Australia, and the African 
cluster connects with the Pacific Asian one. We also have a connections between Egypt and Ukraine, probably 
the effect of noise. 

At T = 0.6, many more connections are formed. The only indices to join the now Global cluster comprised 
of the American, European, Pacific Asian and African clusters are those from Iceland (with Europe and Africa), 
Bosnia and Herzegovina, which joins Europe and Asia via Ukraine, which also joins the Global cluster, Bulgaria, 
and Kazakhstan. Jordan, Saudi Arabia, Qatar, the United Arab Emirates, and Ohman form a loosely knit 
Arab cluster which is also connected to the other clusters. Egypt now makes connections with the Global 
cluster at several nodes. For T = 0, 7, noise starts to take over, and we have many more connections, many 
of them apparently of random nature, and the joining of Costa Rica, Serbia, Slovenia, Macedonia, Estonia, 
Kuwait, Bahrein, Vietnam, Philippines, Nigeria, Kenya, and Mauritius. Bermuda, Jamaica, Malta, Slovakia, 
Palestine, and Lebanon join the Global cluster at T = 0.8. Panama, Venezuela, Solvakia, Montenegro, Sri 
Lanka, Bangladesh, Morocco, Tunisia, Ghana, Tanzania, and Botswana only join at T = 0.9. The network is 
fully integrated at T = 1.3. 



2.6 Second Semester, 2009 

Figure 6 shows the three dimensional view of the asset graphs for the second semester of 2009, from T = 0.2 
to T = 0.5. 
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Fig. 6. Three dimensional view of the asset graphs for the second semester of 2009, with threshold ranging 
from T = 0.2 to T = 0.5. 

At T = 0.1, we have two clusters: the first comprised of S&P and Nasdaq, and the second, the European 
one, whose components are the UK, France, Germany, Italy, the Netherlands, and Spain. At T = 0.2, the 
American cluster remains unchanged and the European cluster is joined by Switzerland, Belgium, Sweden, 
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Finland, Norway, and Portugal. Greece forms a pair with Cyprus and Namibia forms a pair with South Africa. 
For T = 0.3, the American cluster is joined by Canada and connects with Europe through France and Germany, 
Brazil and Argentina form a pair, and the European cluster is joined by Austria, Denmark, Hungary, Poland, 
and Russia, which connects to many nodes in the European cluster. Also, a pair is formed between Hong Kong 
and Singapore. 

At T = 0, 4, Brazil, Argentina, and Peru join the American cluster, which makes even more connections 
with the European one. The latter grows denser and is joined by Luxembourg, the Czech Republic, and Poland 
and connects with the African cluster (Namibia and South Africa) at several nodes. Estonia and Lithuania 
form an isolated pair, Qatar and the United Arab Emirates form the seed of an Arab cluster, Japan and Hong 
Kong connect with Australia, and Hong Kong connects with Thailand, forming an Asian Pacific cluster. There 
is also a strange connection between Hong Kong and Kazakhstan. 

For T = 0.5, Chile joins the American cluster, which makes more connections to Europe, including Russia, 
and which also connects with the African cluster. The European cluster grows even denser and is joined by 
Greece and Cyprus, Romania, Bulgaria, and Turkey. Joined via Bulgaria to the European cluster are the 
indices from Lithuania (and so Estonia) and Kazakhstan. The Pacific Asian cluster is now joined by India, 
Taiwan, South Korea, Malaysia, Singapore, and Indonesia, growing to a large cluster. Singapore, Indonesia, 
and India make some marginal connections with the European cluster, thus weakly connecting the Pacific Asian 
cluster with Europe, and Hong Kong connects with South Africa. The Arab cluster (Qatar plus the United 
Arab Emirates) is joined by Egypt and Ohman. 

At T = 0.6, Colombia becomes part of the American cluster, and Venezuela connects with Spain and 
Portugal. Serbia, Ukraine, Israel, and Saudi Arabia connect with the Global network at many points. Jordan 
joins the Arab cluster, China connects with Hong Kong, and New Zealand connects with Australia and Bulgaria. 
At T = 0.7, a threshold in which noise becomes more important, Iceland, Slovakia, Croatia, Slovenia, Bosnia 
and Herzegovina, Montenegro, Macedonia, and Latvia, join the European network and, at some nodes, the 
Global network in general. Palestine, Lebanon, Kuwait, Bahrein, and Pakistan all join the Arab cluster, also 
establishing some connections with the Global network. Panama, Morocco, Tunisia, Ghana, Nigeria, Tanzania, 
and Mauritius establish some connections, most probably the result of noise. For larger thresholds, it becomes 
nearly impossible to distinguish true connections from random ones, and the last connections occur at T = 1.3. 

2.7 First Semester, 2010 

Figure 7 shows the three dimensional view of the asset graphs for the first semester of 2010, from T = 0.2 to 
T = 0.5. 

At T = 0.1, there already are three clusters: the American cluster, formed by the pair S&P - Nasdaq, the 
European cluster, formed by the UK, France, Germany, Italy, Belgium, and the Netherlands, and the African 
cluster, formed by the pair Namibia - South Africa. At T = 0.2, the European cluster grows denser and is 
joined by Switzerland, Austria, Sweden, Denmark, Finland, Norway, Spain, and Portugal. Greece forms a pair 
with Cyprus and the remaining clusters stay the same. For T = 0.3, Canada and Argentina join the American 
cluster. Ireland, the Czech Republic, Hungary, Poland, Russia, and Turkey join the European cluster, which 
also connects with the African one. A Pacific Asian cluster is formed, comprised by India, Japan, Hong Kong, 
Taiwan, South Korea, Singapore, and Australia. At T = 0.4, the American cluster is joined by Mexico and 
Brazil, and establishes a first link with Europe through Sweden. Peru links with the American cluster, but also 
to the European one through Switzerland. Luxembourg and Greece join the European cluster, which connects 
to the Pacific Asian one, and a pair is formed between Estonia and Lithuania. The Pacific Asian cluster is 
joined by Indonesia and connects with Kazakhstan. 

For T = 0.5, Chile joins the American network, which makes many more connections with Europe and a 
single connection with Africa. The European cluster grows even denser and is joined by Croatia, Romania, by 
the pair Estonia - Lithuania, Ukraine, and Israel. Thailand, Malaysia, Philippines, and New Zealand join the 
Pacific Asian network, which now connects with Africa and makes many more connections with Europe. At 
T = 0.6, many more connections are made within and between clusters. Colombia joins the American cluster. 
Saudi Arabia, Qatar, the United Arab Emirates, and Ohman connect with Europe, and the last three ones 
compose an Arab cluster which is also connected with the Pacific Asian one. China joins the Pacific Asian 
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cluster, and Egypt connects with all clusters. 

At the beginning of the noisy region, at T = 0.7, Iceland and Bulgaria join the Global network at several 
nodes, Serbia and Slovenia join Europe, and Macedonia connects with Egypt. Jordan, Kuwait, and Bahrein 
join the Arab network, and Vietnam joins the Pacific Asian cluster. For higher thresholds, many connections 
are established apparently at random. Nigeria, Kenya, Tanzania, and Mauritius join the Global cluster at 
T = 0.8, and Venezuela, Palestine, Lebanon, Pakistan, Sri Lanka, Bangladesh, Mongolia, Morocco, Tunisia, 
Ghana, and Botswana join the Global cluster at T = 0.9. No new connections are established beyond T = 1.3. 





2.8 Second Semester, 2010 

Figure 8 shows the three dimensional view of the asset graphs for the second semester of 2010, from T = 0.2 
to T = 0.5. 

The second semester of 2010 was a time of lower volatility and also of lower correlation, so connections 
between indices are also weaker. At T = 0.1, the only connections are those between S&P and Nasdaq, and 
between France, Germany, Belgium, and the Netherlands, forming the seeds of an American and an European 
clusters. For T = 0.2, the American cluster remains the same and the European cluster grows denser and is 
joined by the UK, Switzerland, Austria, Italy, Finland and Spain. The pair Namibia - South Africa, the African 
cluster, is formed at this threshold. At T = 0.3, the European cluster is joined by Ireland, Sweden, Norway, 
Portugal, and Greece, and makes one single connection with the African cluster. 

For T = 0.4, the American cluster finally grows to include Canada, Mexico, Brazil, and Argentina, and 
also connects with Europe. The European cluster keeps growing and now includes the Czech Republic, Poland, 
Russia, and Cyprus (through Greece). The Pacific Asian cluster begins to form with Hong Kong, Taiwan, and 
Australia. T = 0.5 brings Colombia and Peru into the American cluster, Luxembourg, Hungary, Turkey, and 
Israel into the European cluster. The Pacific Asian cluster grows with the addition of Japan, China (connected 
with Hong Kong), South Korea, Singapore, and Indonesia. 
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For T = 0.6, Chile joins the American cluster, and Lithuania, Ukraine, Saudi Arabia, and India connect with 
the European cluster. Some connections form between the Pacifi Asian and European clusters, and Thailand 
and Singapore join the Pacific Asian cluster. Noise is already influent at T = 0.7, where the American network 
finally establishes connections with the African cluster. Romania and Estonia join the European cluster, Qatar 
and Ohman connect with the Pacific Asian cluster and not between themselves. Philippines also join the Pacific 
Asian cluster, and Egypt establishes connections with both the European and the Pacific Asian clusters. Many 
more connections are made from T = 0.8 and beyond, but it is practically impossible to separate meaningful 
connections from random ones, anymore. The last connections occur at T = 1.3. 





2.9 Whole data and Pearson correlation 

I now shall analyze two types of criticism the previous results might face. The first one is the use of possibly 
too few data in order to generate the networks, which might introduce much statistical noise to the results 
obtained. The second is the use of the Spearman rank correlation instead of the Pearson correlation. 

In order to investigate the effects of the amount of data used and the choice of correlation, I now make two 
asset graphs, both based on the whole data (1015 days) from 2007 to 2010, the first one based on the Spearman 
rank correlation and the second based on the Pearson correlation. Since the two measures don't have necessarily 
the same values, I chose T = 0.5 for the threshold of the Spearman correlation and T = 0.452 for the threshold 
of the Pearson correlation. The first threshold is a compromise between visibility and proximity to the noise 
level, and the second value minimizes the differences between both asset graphs in terms of the number of 
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connections in each graph. In order to facilitate the comparison, the coordinates based on the Spearman 
correlation are being used for both graphs. 

Note that there are nearly no diffferences between both graphs. All clusters are preserved at all levels for 
other thresholds as well. So, one may conclude that most connections between the indices are basically linear 
and that the choice between using Pearson or Spearman correlation doesn't modify the major results. 
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Fig. 9. Three dimensional view of the asset graphs for the whole period of data (2007 to 2010). The figure on 
the left is based on the Spearman rank correlation with threshold T = 0.5, and the figure on the right is based 
on the Pearson correlation with threshold T = 0.452. 



Also, if one compares Figure 9 (left) with the asset graphs obtained for the separate semesters, one can see 
the connections are quite stable in time, which is a discussion that shall be pursued further in Section 3. 



2.10 Lagging indices 

A third issue that could be raised is the use of same-day data for calculating correlations, when in truth markets 
don't operate at the same time. As an example, there is no intersection between the opening hours of the Tokyo 
Stock Exchange and the New York Stock Exchange. The issue of to lag or to not lag data is an important 
one, which shall be resolved in some future work. What can be done for now is to lag some indices so that 
we will compare them to the non-lagged indices of the previous day. In order to decide which indices to lag, 
we shall take a look at the eigenvector of the second highest eigenvalue of the correlation matrix between the 
indices. As stated in [1] and in many other works, the eigenvector of the first highest eigenvalue is related with 
a market mode which explains the common movement of the indices. Such eigenvalue is set far apart from 
all others, as shown in Figure 10, being much higher than it would be expected from those of a correlation 
matrix based on noise, and its corresponding eigenvector shows equal contributions from nearly every index 
being studied. In Figure 10, the noise region is drawn as a shady area. Here, this area was calculated by 
randomizing each time series of each index so as to eliminate any correlation between them, but keeping their 
frequency distributions intact. The result was obtained from 1000 simulations using randomized data, but is 
very similar to the theoretical result obtained using Random Matrix Theory |14| . 




Fig. 10. Eigenvalues of the correlation for data from 2007 to 2010, in order of magnitude. The shaded area 
corresponds to the eigenvalues predicted for randomized data. 



12 



The second highest eigenvalue is set not as far apart from the others in the region considered as noise as 
the highest one. Its corresponding eigenvector, e^, (Figure 11) reveals a part of the internal structure of the 
correlations that is peculiar to indices of stock exchanges worldwide, which operate at different intervals of 
time pQ. In the figure, white bars correspond to positive signs in the eigenvector, and gray bars correspond to 
negative signs in the eigenvector. 
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Fig. 11. Contributions of the stock market indices to eigenvector e2, corresponding to the second largest 
eigenvalue of the correlation matrix. White bars indicate positive values, and gray bars indicate negative 
values, corresponding to the first and second semesters of 2001. The indices are aligned in the following way: 
S&P, Nasd, Cana, Mexi, Pana, CoRi, Berm, Jama, Braz, Arge, Chil, Colo, Vene, Peru, UK, Irel, Fran, Germ, 
Swit, Autr, Ital, Malt, Belg, Neth, Luxe, Swed, Denm, Finl, Norw, Icel, Spai, Port, Gree, CzRe, Slok, Hung, 
Serb, Croa, Slov, BoHe, Mont, Mace, Pola, Roma, Bulg, Esto, Latv, Lith, Ukra, Russ, Kaza, Turk, Cypr, 
Isra, Pale, Leba, Jord, SaAr, Kuwa, Bahr, Qata, UAE, Ohma, Paki, Indi, SrLa, Bang, Japa, HoKo, Chin, 
Mong, Taiw, SoKo, Thai, Viet, Mala, Sing, Indo, Phil, Aust, NeZe, Moro, Tuni, Egyp, Ghan, Nige, Keny, 
Tanz, Nami, Bots, SoAf, and Maur. 

All indices of countries that are from Greece to the East, in terms of latitude, with the exceptions of 
Russia, Turkey, Israel, and South Africa, appear with negative values in eigenvector e%. All these exceptions 
are indices from stock exchanges that have a non-negative intersection with the opening hours of the New York 
Stock Exchange. Also, from Greece to the West, the indices from Panama, Costa Rica, Bermuda, Jamaica, 
Venezuela, Malta, and Iceland, all of them indices with very low correlation with all the others, also appear 
with negative values in the same eigenvector. 

If one laggs all the indices that have negative values in eigenvector e2 , which means that one compares their 
returns at day t + 1 with day t of all the other indices, one may then create a new returns matrix and thus a 
new correlation matrix. The eigenvalues of such a matrix are plotted in Figure 12, together with the region 
associated with noise, the result of yet more 1000 simulations with randomized data. 
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Fig. 12. Eigenvalues of the correlation for data from 2007 to 2010, in order of magnitude. The shaded area 
corresponds to the eigenvalues predicted for randomized data. 

The first largest eigenvalue is still associated with a market mode, but the second largest eigenvalue, more 
detached now from the others than in the non-lagged case, means something else. Figure 13 shows the values 
of the indices in the new eigenvalue associated with the second largest eigenvalue. Once more, white bars 
correspond to positive signs in the eigenvector, and gray bars correspond to negative signs in the eigenvector. 
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Fig. 13. Contributions of the stock market indices to eigenvector e%, corresponding to the second largest 
eigenvalue of the lagged correlation matrix. White bars indicate positive values, and gray bars indicate negative 
values, corresponding to the first and second semesters of 2001. The indices are aligned in the following way: 
S&P, Nasd, Cana, Mexi, Pana, CoRi, Berm, Jama, Braz, Arge, Chil, Colo, Vene, Peru, UK, Irel, Fran, Germ, 
Swit, Autr, Ital, Malt, Belg, Neth, Luxe, Swed, Denm, Finl, Norw, Icel, Spai, Port, Gree, CzRe, Slok, Hung, 
Serb, Croa, Slov, BoHe, Mont, Mace, Pola, Roma, Bulg, Esto, Latv, Lith, Ukra, Russ, Kaza, Turk, Cypr, 
Isra, Pale, Leba, Jord, SaAr, Kuwa, Bahr, Qata, UAE, Ohma, Paki, Indi, SrLa, Bang, Japa, HoKo, Chin, 
Mong, Taiw, SoKo, Thai, Viet, Mala, Sing, Indo, Phil, Aust, NeZe, Moro, Tuni, Egyp, Ghan, Nige, Keny, 
Tanz, Nami, Bots, SoAf, and Maur. 

Eigenvector e2 detects the structure of the European block, which consists on indices from countries from 
Central Europe, and also of indices from Hungary, Poland, Russia, Turkey, Israel, and South Africa, which 
basically are part of the European cluster. Also appearing with negative but diminute values in eigenvector e<i 
are the indices from Panama, Chile, Colombia, Peru, and Mongolia, most of them with marginal connections 
in the correlation network. 

Now, lets analyze the network formed using lagged data. Figure 14 shows the resulting asset graph for 
lagged data at threshold T = 0.5 using Spearman correlation in comparison to the netwrok based on the 
original (non-lagged) data. What one can readily see is that the new coordinates for the lagged indices are 
more stretched, with the American cluster placed in between the European and the Pacific Asian clusters, and 
with some of the outlier indices from Europe placed farther than the Pacific Asian ones. The connections, 
nevertheless, are practically the same, so there are nearly no changes in the way indices connect, even when 
lagging some of them. This confirms the robustness of the asset graphs that have been built here so far. 




IndoHoKo 




Fig. 14. Three dimensional view of the asset graphs for the whole period of data (2007 to 2010). The figure 
on the left is based on the non-lagged data, and the figure on the right, on lagged data, both with threshold 
T = 0.5. 



2.11 Overview 

Let us now draw some conclusions from the many graphs seen in this section. First of all, we saw that there are 
two main clusters that maintain their cohesiveness through time and through most leves of threshold, which are 
the American and the European clusters. The American cluster is basically comprised of the indices S&P 500 
and Nasdaq, which are soon joined by Canada, and then by some Latin American indices. The European cluster 
is denser, and comprised of a core of Central European indices, then joined by Eastern European indices. We 
can also identify two other persistent connections: one between South Africa and Namibia, and other between 
Greece and Cyprus. These are not surprising, for Namibia, whose economy is in early stages of development, 
is highly dependent both economically and politically on South Africa. Now Cyprus is divided between Greek 
and Turkish populations, and the Stock Exchange to which the index from Cyprus is assigned is in Nicosia, 
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in the Greek part of the island. We also have a similar connection, at higher thresholds, between Estonia and 
Lithuania, both former members of the extinct Soviet Union. 

The pair South Africa-Namibia is firmly connected with the European cluster, and Israel also has a mildly 
strong connection with the same cluster. At higher thresholds, a Pacific Asian cluster unfolds, with Australia 
attached to it, and at still higher thresholds, close to the noise limit, we can see an Arab cluster forming. We 
don't see the formation of an African cluster until deep inside the thresholds already in the region dominated 
by noise. The American and European clusters do not form connections until a certain threshold, and Pacific 
Asia makes connections with Europe at slightly higher values of the threshold. So, what we see are three 
major clusters, based on geographical and sometimes cultural ties, which maintain some indepedence, with the 
addition of more indices as the threshold values grow. From the data concerning the second semester of 2008, 
we can also see that networks shrink in times of high volatility, which means there is higher correlation between 
the indices, as already stated in [15] . All connections are stable both in what concerns the choice of correlation 
measure and the lagging of some of the indices so as to acommodate differences in operating hours of the stock 
exchanges. 

3 Survivability of connections and clusters 

I now shall turn my attention to the survivability of connections between indices, and specifically of clusters of 
indices, through time. By survivability I mean the endurance of connections in time given a certain threshold. 
The survival of connections is heavily dependant on the threshold value chosen for the network. For T = 1.5, 
as an example, all connections survive 100% of the time, for all indices are fully connected at this threshold. 
Now, for lower threshold values, survivability becomes scarce. 

The timeline that is being studied is varied in terms of volatility and correlation of indices. Figure 15a 
shows a graph of the average volatility of the MSCI (Morgan and Stanley Capital International) World Index, 
based on a portfolio of indices from a variety of developed economies worldwide. The average volatility of this 
index is calculated by taking the standard deviations of the log-returns of the index for running windows of 
100 days each with the windows shifted by one day at a time. Plotted in figure 15b is the average correlation 
of the 92 indices I am using in this research from correlation matrices calculated in overlapping windows of 100 
days of the complete time series from 2007 to 2010. Since I am considering windows of 100 days, I assign the 
result of each window to the last day of that window. So, the curves in both graphs start 100 working days 
after the beginning of 2007 (05/22/2007). 

What may be clearly noticed is that high volatility is associated with high correlation of financial markets 
(as stated in [15J and references therein), which means that markets tend to behave similarly in times of crisis. 

, Correlation 



day 

01/2008 01/2009 01/2010 01/2011 01/2008 01/2009 01/2010 01/2011 

Fig. 15. a) Average volatility of the MSCI World Index in time, b) Average correlation of the 92 indices in 
time. 
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3.1 Single step survival ratio 

Measuring survivability of connections in a network is not a difficult task, but there are many ways to do 
that and they are not all equivalent. The simplest measure is to consider the single step survival ratio [3], here 
adapted to asset graphs, defined as the fraction of edges (connections) common to two consecutive asset graphs, 
divided by the number of edges of the asset graph. This can be easily calculated by defining the adjacency 
matrix A |16| . which is a matrix whose elements are one if there is a connection between indices i and j 
and zero otherwise. In the present case, an asset graph may be represented as the distance matrix between all 
indices, but limited by a threshold value T. So, the adjacency matrix associated with this distance matrix with 
a cut-off has ones for distances dij < T and zeros for dij > T. Now, it is fairly easy to define the single step 
survival ratio: it is the sum of all elements of the product of an adjacency matrix at time t with an adjacency 
matrix at time t + 1, divided by the number of edges at time t: 



1 - 



(3) 



where Aij{t) is the element ij of matrix A(t), Aij(t + 1) is the element ij of matrix A(t + 1), and ./V is the 
total number of nodes in the correlation matrix (this is the same as the scalar product of both matrices divided 
by M). Since I am using a distance threshold and not the number of edges to establish an asset graph, the 
number of edges is not constant in time. So, I am using M as the number of connections in the asset graph 
related with A(t). 

The results depend on the threshold T, and Figure 16 shows the results for thresholds going from T = 0.1 
to T = 0.5 and T = 0.6, just below and above the noise threshold. For high threshold values up to T = 1.5, 
S(t) assumes values that increasingly lead to S(t) = 1 for all values of t, for T = 1.5 is the distance at which 
all nodes are connected and so all connections survive all the time. 
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Fig. 16. Single step survival ratio for various asset graphs based on a diversity of thresholds for running 
windows of 100 days with steps of one day at a time. 

As expected, the single step survival ratio varies much for T = 0.1 and T = 0.2, for one has, for these 
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thresholds, very few connections, although strong ones. As the value of the threshold grows, one has many 
more connections, and then it is expected that the single step survival ratio goes slowly towards the unit value. 
If one considers T = 0.5, which borders the noise limit, one then has strong variations of the single step survival 
ratio at 10/22/2009, 11/03/2009, 04/29/2010, 10/22/2010, and 12/20/2010, which do not coincide with any 
particular important event. When analyzing this, one should bear in mind that the single step survival ratio is 
being calculated over windows of 100 days, and so it is not very sensitive when pinpointing events that happen 
in one or few days. 

If one now wished to compare the single step survival ratio of real data with the one of randomized data, 
the first striking difference between them is that, for randomized data, there are virtually no connections bellow 
T = 0.7. So, one can only compare the results for connections above this distance value. Figure 17 shows the 
comparison of the single step survival ratio for real (graph on the left) and randomized (graph on the right) 
data for a typical simulation where all time series were randomly shuffled. One thing to notice is that the 
single step survival ratio for randomized data oscillates around 0.87. The other is that it doesn't come close to 
the value 1, so that the single step survival ratio of randomized data is never large, and it is also much more 
volatile (high standard deviation). The fact that the survival ratio is so high for randomized data is that it is 
an effect of my use of overlapping windows of 100 days with shifts of one day at a time. This also applies to 
the real data, but does not explain by itself the high survival ratio in that case. 
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Fig. 17. Single step survival ratio for T = 0.8 for running windows of 100 days with steps of one day at a time. 
The graph on the left is based on real data, and the graph on the right is based on randomized data. 



3.2 Multi step survival ratio 

A natural generalization of the single step survival ratio is the multi step survival ratio [3j, which is the 
percentage of edges that exist in a certain number of windows, divided by the total number of edges. Here I 
shall adapt this definition to the case where asset graphs are built based on threshold values for the distance, 
in which case the number of edges may vary in each window. The definition here of a multi step survival ratio 
for p windows is 

1 N 

S p (*) = f E Aij$)Aii(t + !)••■ Mt+P) > ( 4 ) 

where Aij(t) is the element ij of matrix A(t), N is the total number of nodes in the correlation matrix, and M 
is the total number of edges in time t. 

It is expected that the multi step survival ratio drops as the number of windows p grow, for it is increasingly 
difficult for a connection (edge) to survive at all intervals being considered. It is also expected that for a large 
enough value of the thresholds, where all nodes are connected to all others, this survival ratio will always have 
the value 1. Figure 18 (left graph) shows the multi step survival ratios for thresholds ranging from T = 0.1 to 
T = 1.5 (fully connected network) as functions of the number of windows. 

Note the high concentration from thresholds T = 0.5 to T = 0.8 (in red), and also with T = 1.1. It is at 
these values that one has the most information about connections yet with a bearable amount of noise. For 
values bellow T = 0.5, there are two few connections, although strong ones, which makes it easier for the multi 
step survival ratio to drop faster, and for very high thresholds, there are so many connections that it is easy 
for a particular edge to survive all the time. 

The graph on the left of Figure 18 shows the multi step survival ratio for a typical simulation with ran- 
domized data, obtained by shufling randomly the time series of the indices. One difference is that there are no 
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asset graphs for distance thresholds bellow T = 0.7, for there are no connections formed bellow this threshold 
for randomized data (only very rarely). Other difference is that the multi step survival ratio for randomized 
data decays nearly exponentially, as it was to be expected from data that should not be related. 

It is also interesting to compare this multi step survival ratio with the one obtained from a minimum 
spanning tree [IU] |13j . Since edges built using a minimum spanning tree algorithm are much less stable than 
for asset graphs, it is much easier for those connections to change over time, leading to a near exponential 
decay of the multi step survival ratio, which does not occur for connections based on asset graphs. 
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Fig. 18. Left: multi step survival ratios for thresholds ranging from T = 0.1 to T = 1.5 as functions of the 
number of windows; the curves related with thresholds from T = 0.5 to T = 0.8 are in red. Right: the same 
for randomized data. 



3.3 Survivability 

The multi step survival ratio depends on the starting point of the data being considered and also on the time 
shift between windows. So, it is a measure that, like the single step survival ratio, depends on time. An 
alternative measure derived in [13] considers each day as a starting point and then tests the multi step survival 
for a number D of days (5, for example) of an asset graph built with a threshold T. Then the percentage of 
tests in which a particular edge survives is used in order to define a survivability measure. As an example, 
the connection between nodes i and j for five consecutive days is tested with many different starting points in 
running windows of length At. If the connection survives in 30% of the tests, then its survivability is s = 0.3. 
So, survivability is a global (in terms of time) measure of how stable a connection between two nodes is through 
a certain period of time. Dividing the number of surviving edges by the total number of edges of the asset 
graph, one obtains the survivability ratio, S r , that goes from to 1. 

Performing an analysis of survivability for the data concerning the 92 stock market indices from 2007 to 
2010, one obtains the results in Figure 19. In this figure, the survivability ratio is plotted against the percentage 
of tests in which that number of connections survived. 

Once again, there is a concentration of curves from T = 0.5 to T = 0.8, which is the region around the noise 
limit. For these threshold values, the percentage of connections that survive 100% of the time is around 44% 
which, compared with the survivability of connections in a Minimum Spanning Tree [13] is, not surprisingly, 
very high. For T = 0.1, there is just one connection (from the original 2) that survive 100% of the time: the 
one between France and Germany. This number grows and, by T = 0.5, the connections between S&P and 
Nasdaq, between the UK, Ireland, France, Germany, Switzerland, Austria, Italy, Belgium, the Netherlands, 
Sweden, Denmark, Finland, Norway, Spain, Portugal, and Poland, between Greece and Cyprus, and beween 
Japan, Hong Kong, Singapore, and Australia survive 100% of the time. So, again one can detect a small 
American cluster, a large European cluster, and a Pacific Asian one (including Australia). The number grows 
for higher threshold values until every index is connected to every other index, which happens by T = 1.5, and 
all connections survive 100% of the time. 

Taking T = 0.7, as another example or 100% survivability, one has an American cluster, including indices 
from North and South America, a large European cluster (which includes Israel and South Africa), a small 
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Pacific Asian cluster, the pairs Greece - Cyprus and Namibia - South Africa, and connections between the 
North American cluster with the other two. By fixing the threshold value and varying the survivability ratio, 
one may obtain a sequence of networks that grow as the survivability ratio drops. This procedure may be used 
in order to build "survivability networks", which may bring some information on the stability of connections. 




Fig. 19. Survivability ratios for thresholds ranging from T = 0.1 to T = 1.5 as functions of the percentage of 
surviving windows. On the left, it is plotted the survivability for the network obtained from the stock market 
indices; on the right, the survivability for a network built with randomized data. 

Also in Figure 19, to the right, I plot the survivability ratio for a network obtained by randomly shuffling 
the time series of the 92 indices being studied. The curves are results of single simulations, for the computations 
are quite long, but one does not lose too much information here by doing this. From T = 0.1 to T = 0.7, 
there are no connections in the network and so one cannot even define a survivability ratio. From T = 0.8 to 
T = 1.1, the survivability curves drop exponentially, and for values higher than T = 1.2, it quickly approaches 
the line S = 1, for all connections are active all the time for large enough values of the threshold. The difference 
between the results obtained with randomized data and with real data are striking: survivability is often much 
higher for real data, showing there must be information on the connections formed by their correlations. 

Figure 20 (graph on the left) analyzes the relationship between distance and survivability for T = 0.5. The 
horizontal axis is the distance between nodes as calculated by ([2]), and the vertical axis is the percentage of 
time a connection survives. What one can readily see is that, like in the case of Mininum Spanning Trees, there 
is no linear relation between the two variables. Looking at the data, one could see that around 93% of the 
connections have survivability zero, specially those that represent distances higher than 0.4. Also, around 3% 
of the connections have survivability 1 (100% ), none of them with a distance higher than 0.45. There is also 
a lump of connections that survive between 30% and 60% of the time with distances varying from 0.2 to 0.4 
(around 3% of the total number of connections). 




Fig. 20. Left: scatter plot of the distance d between nodes and the percentage of time a connection survives, 
s, for T = 0.5. Right: scatter plot of the distance d between nodes and the percentage of time a connection 
survives, s, for T = 0.8 and randomized data. 
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The graph on the right in Figure 20 shows the relationship between distance and survavibility ratio based 
on randomized data and a threshold T = 0.8 (for thresholds bellow T = 0.6 have no connections for randomized 
data). The difference is clear, as distances concentrate around a certain value and connections tend to survive 



So, concluding this section, we saw that there are remarkable differences between real data and randomized 
data in terms of survivability, be it measured by the single step survival ratio, the multi step survival ratio 
or the survivability ratio. For real data, the curves for thresholds ranging from T = 0.5 to T = 0.8 tend 
to agglomerate, being that the region where one can get the most information about the network with an 
acceptable level of noise. 

We could also see that some connections, and by extension, some clusters remain constant in time. Those 
clusters are heavily associated with geoographic position and on cultural and ethnical ties. Those connections 
that were more intense also tended to last longer, although there are many exceptions to this rule. Next, the 
effects of the choice of threshold and of time in centrality measures of network theory are studied. 

4 Centrality Measures 

Centrality measures are of key importance in network theory. Whereas a node has many connections, or it is 
on the shortest path between many pairs of nodes, or if it lies within a heavily connected region, are of great 
importance for models of propagation of information, of diseases, and on the vulnerability of a network to 
attacks, as examples. So, many measures of how central a node is in a network were developed, and each one 
associates a different concept to the word "centrality". Here, we shall see how one of those measures change 
over time, and what is the influence of the choice of threshold when building an asset graph on those centrality 
measures. 

4.1 Node degree 

The node degree of a node is the number of connections it has in the network. Most of the stocks have low 
node degree, and some of them, called hubs, have a large node degree and are generally nodes that are more 
important in the dynamics of a network. Considering the adjacency matrix A of a network, the node degree of 
node can be defined mathematically as 



Node degrees vary considerably according to the threshold value of an asset graph. It is expected that, 
for small threshold values, where very few connections are formed, most node degrees are zero, and, for high 
threshold values, where all connections are formed, all nodes will have the highest node degree that is possible. 
As the maximum node degree vary from 1 to 92 as thresholds grow, I divided all node degrees by the maximum 
node degree for better comparison of the asset graphs. 

Considering that the asset graphs are formed only by those nodes which are connected to any other node, 
then the asset graphs for low threshold values are much smaller. Taking this into account, one must eliminate 
from the network all nodes which have node degree zero. Figure 21 illustrates frequency distributions of the 
normalized node degrees for thresholds going from T = 0.1 to T = 1.1, and for randomized data for T = 1.0. 

Extreme values for the threshold lead to distributions that are heavily concentrated. Again, the distributions 
close to the noise threshold seem to be the ones that give the most information. For thresholds down to T = 0.7, 
indices alternate among the top highest node degrees in an apparently random way. From T = 0.6 down to 
T = 0.1, Central European indices occupy the top positions, particularly France, the UK, the Netherlands, and 
Germany. 
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Fig. 21. Histograms of the normalized node degrees of asset graphs based on thresholds varying from T 
to T = 1.1, plus histogram for randomized data at T = 1.0. 
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From Figure 21, one can also see that the resulting histograms are definitely not exponentially decreasing, 
so they cannot be represented by power laws of the type = ck~ a , where pk is the frequency distribution 
for the value k, and c and a are constants. Power laws are the hallmark of a diversity of complex systems, 
as in the study of earthquakes, the world wide web, networks of scientific citations, of film actors, of social 
interactions, protein interactions, and many other topics [16J. Networks whose centrality measures follow this 
type of distribution are often called scale-free networks, and this behavior can best be visualized if one plots 
a graph of the cumulative frequency distribution of a centrality in terms of the centrality values, both in 
logarithmic representation [16J . Figure 22 shows the logarithm of the cumulative frequency distributions as 
functions of the logarithm from T = 0.1toT = l.l (graph on the left), and for randomized data (graph on the 
right), from T = 0.9 to T = 1.2 (for T = 0.9, the curve is on the vertical axis). 

So, although the cumulative frequency distribution of the node degrees for the asset graphs is not the one 
of a scale-free network, they are definitely not the same as the cumulative frequency distribution of randomized 
data. A cautionary note must be added here. As the networks that are being considered in this analysis are 
very small compared with scale-free networks like the internet, or pure scale-free networks, which must be 
infinite in size. So, conclusions about these asset graphs networks, particularly the ones with smaller thresholds 
and a very small number of nodes, are not reliable. 
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Fig. 22. Cumulative frequency distribution of the node degree in terms of the centrality values, both in 
logarithmic representation. 

In Figure 23, I show the evolution of the average node degree as a function of time, Nd(t), which is the 
mean of the node degrees of the nodes belonging to each asset graph (the number of nodes may change in 
time). In the calculation of the graph, I have used windows of 100 days, shifted one day at a time. It does 
make data overlap, but gives a better number of points for the plot. The curves are, as expected, very similar 
to the graph of the average correlation in time and, as a consequence, to the graph of the volatility of the world 
market as measured by the MS CI World Index. The aggreeement between the average node degree and the 
average correlation time series may be measured by their Pearson correlation, which goes from 0.38 for T = 0.1 
to 0.95 for T = 0.8. Figure 24 shows the time series of the average node degree for T = 0.7 in terms of the 
average correlation, < C > (t), time series. 
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Fig. 23. Evolution of the average node 
degree, Nd(t), in time. 
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Fig. 24. Average node degree Nd(t) for T 
times average correlation < C > (t). 
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4.2 Node strength 

The strength of a node is the sum of the correlations of the node with all other nodes to which it is connected. 
If C is the matrix that stores the correlations between n nodes that are linked in the asset graph, then the 
node strength is given by 



Ns J 



i=i 



(6) 



where CV,- is an element of matrix C. 

For the asset graphs of the world stock indices, the Central European indices appear as the most central 
according to node strength for all threshold values. The topmost indices are those from the Netherlands, 
France, Austria, Belgium, Austria, the UK, Germany, and Italy. 
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Figure 25 displays the cumulative frequency distribution of the node strength in terms of the centrality 
values, both in logarithmic representation. The figure on the left displays the curves for real data (from 
T = 0.1 to T = 1.2), and the figure on the right represents the curves for randomized data (from T = 0.9 
to T = 1.2). Again, the results are remarkably different from randomized data, but they do not seem to 
correspond to scale- free networks. 
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Fig. 25. Cumulative frequency distribution of the node strength in terms of the centrality values, both in 
logarithmic representation. The figure on the left displays the curves for real data, and the figure on the right 
represents the curves for randomized data. 



4.3 Eigenvector centrality 

One node may have a low dregree, but it may be connected with other nodes with very high degrees, so it is, in 
some way, influent. A measure that takes into account the degree of neighbouring nodes when calculating the 
importance of a node is called eigenvector centrality. If one considers the eigenvectors of the adjacency matrix 
of the distance matrix for the asset graph, and choosing its largest value, one may then define the eigenvector 
with largest eigenvalue by the equation 

AX = XX , (7) 

where X is the eigenvector with the largest eigenvector A. The eigenvector centrality of a node i is then defined 
as the ith element of eigenvalue X: 

Ed = xt , (8) 

where Xi is the element of X in row i. 

Here, there is a clearer set of nodes which exhibit more eigenvector centrality values than the others in a 
consistent way for the thresholds closest to the limit of the noise region. The UK, France, the Netherlands, 
Germany, Austria, and Denmark appear as the top nodes from T = 0.1 up to T = 0.6, which is not surprising, 
since they inhabit a densely connected region of the asset graphs. 

Figure 26 displays the cumulative frequency distribution of the eigenvector centrality in terms of the cen- 
trality values, both in logarithmic representation. The figure on the left displays the curves for real data, and 
the figure on the right represents the curves for randomized data. All curves are quite close to one anotherm 
and the curves for extreme values, T = 1.0 and T = 1.1, are very similar to the ones obtained for randomized 
data. 



23 



log(C'um. distr.) 



\og(Cum. distr.) 




log(Ec) 



Fig. 26. Cumulative frequency distribution of the eigenvector centrality in terms of the centrality values, both 
in logarithmic representation. The figure on the left displays the curves for real data, and the figure on the 
right represents the curves for randomized data. 



4.4 Betweenness 

The betweenness centrality measures how often a node lies on the paths between other vertices. It is an 
important measure of how much a node is important as an intermediate between other nodes. It may be 
defined as 



Bc k 



II; 



(9) 



where is the number of shortest paths (geodesic paths) between nodes i and j that pass through node k 
and rriij is the total number of shortest paths between nodes i and j. Our network is fully connected, so we 
need not worry about niij being zero. 

Here, one does not have a particular set of nodes that present high betweenness for most threshold values. 
One may highlight Singapore, which appears among the top four from T = 0.5 to T = 0.8, and the Czech 
Republic, which is among the top four if T = 0.5 or T = 0.6. 

Figure 27 displays the cumulative frequency distribution of the betweenness centrality in terms of the 
centrality values, both in logarithmic representation. Like before, the figure on the left displays the curves 
for real data, and the figure on the right represents the curves for randomized data. The curves for real and 
randomized data are almost exactly the same for T = 1.0 and T = 1.1, which are well inside the random noise 
region. The curves for thresholds ranging from T = 0.4 to T = 0.9 get entangled and are quite distinct from 
the other curves. 
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Fig. 27. Cumulative frequency distribution of the betweenness centrality in terms of the centrality values, both 
in logarithmic representation. The figure on the left displays the curves for real data, and the figure on the 
right represents the curves for randomized data. 
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4.5 Harmonic closeness 



Another measure of centrality is the closeness centrality, which measures the average distance between one 
node and all the others. It is defined as the measure of the mean geodesic distance for a given node i, which is 
given by 



Edij , 



(10) 



where n is the number of vertices and dij is a geodesic (minimum path) distance from node i to node j. This 
measure is small for highly connected vertices and large for distant or poorly connected ones. In order to obtain 
a measure that is large for highly connected nodes and small for poorly connected ones, one then defines the 
harmonic closeness centrality of node i as 



He 1 



1 



1 



n 



1 



n 

£ T 



(ii) 



There is again a set of nodes which consistently have the highest harmonic closeness centralities for assert 
graphs close to the noise threshold. These are Central European indices, like the UK, France, Germany, the 
Netherlands, Italy, and with an Eastern European index, from the Czech Republic. 

Figure 28 displays the cumulative frequency distribution of the harmonic closeness centrality in terms of 
the centrality values, both in logarithmic representation. Once more, the figure on the left displays the curves 
for real data, and the figure on the right represents the curves for randomized data. Again, only the curves for 
thresholds T = 1.0 and T = 1.1 are similar to the ones obtained based on randomized data. 
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Fig. 28. Cumulative frequency distribution of the harmonic closeness centrality in terms of the centrality 
values, both in logarithmic representation. The figure on the left displays the curves for real data, and the 
figure on the right represents the curves for randomized data. 



4.6 Overview 

Our analysis of the centrality measures showed that there is a strong dependence of centralities on the threshold 
chosen to define a particular asset graph, and that, although the probability frequency distributions and the 
cummulative frequency distributions of the four centralities we have researched do not resemble those of scale- 
free networks, they are remarkably different from their counterparts based on randomized data, except for 
high values of the threshold. The values of the threshold between T = 0.5 and T = 0.8, close to the noise 
threshold, are believed to be the ones that generate asset graphs with more information also in terms of 
centralities. Central European indices, like the UK, France, the Netherlands, Germany, Austria, and Italy, 
carry high centrality values, in general, and Singapore and the Czech Republic presente high betweenness 
centrality, Singapore because it connects most Pacifi Asian indices with some European ones, and the Czech 
Republic because it somehow connects Eastern and Central European indices. The final figure of this article 
(Figure 29) shows the five centrality measures discussed in this section as functions of indices. 
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Fig. 29. Centrality measures for asset graphs denned by T = 0.7, T = 0, 6, and T = 0.5 in terms of node. The 
horizontal lines indicate divisions between regions: NoAm for North America, SoAm for South America, CeEu 
for Central Europe, EaEu for Eastern Europe, CeAs for Central Asia, Pa As for Pacific Asia, Oc for Oceania, 
and Af for Africa. 

From the figure, one can see that Central Europe and part of Eastern Europe present a consistent large 
number of centralities, expect for betweenness. North America and part of the Pacific Asia also behave, in a 
lesse degree, in the same way. For betweenness centrality, we have peaks in the Czech Republic and Singapore, 
and their centralities were explained in the previous paragraph. The eigenvector centrality and betweenness 
centrality are particularly useful in determining high centrality, in its two main aspects, nodes that are well 
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connected in regions of densely connected nodes, and nodes that link smaller clusters of nodes. 

5 Conclusion 

The aims of this article may be divided into three parts: the first one was to use asset graphs based on thresholds 
for a distance measure obtained from the correlation matrices of time series of world stock market indices in 
order to study their behavior due to changes in the threshold value and also their evolution in time; the second 
was to analyze the survivability of the connections between indices in asset graphs and, as a consequence, of 
clusters based on regional, economical, and cultural ties, and their dependence on the threshold values; the 
third was to analyze some centrality measures of the nodes in the previous asset graphs, and their dependence 
on threshold values. 

What we have shown is that financial markets tend to group themselves according to geographical region, 
and also according to economic and cultural ties. Two main clusters were detected, an American one, and 
an Eurpean one, both existing already at low threshold values (small distances) and very stable in time (high 
survivability). Other clusters form at higher levels of the threshold, like a Pacific Asian cluster and, at higher 
threshold values, an Arab one. All clusters tend to integrate at higher levels of the threshold, beginning by an 
absorption of some Eastern European nodes, plus Israel and South Africa, by the European cluster, and an 
enlargement of the North American cluster by the addition of South American nodes. An unification of the 
American, European, and Pacific Asian clusters occurs at intermediate values of the threshold. We could also 
see that, in times of crisis, like 2008, those asset graphs tend to shrink in size and augment in number of nodes, 
both consequences of the higher correlations between indices. These results are robust with respect to both the 
choice of correlation coefficient (Spearman or Pearson, specifically), and the lagging of some of the indices in 
order to take into account the difference in operating hours. We also used simulations with randomized data 
in order to establish threshold values above which random noise has a strong influence in the results. 

Another result was that the second highest eigenvalue of the correlation matrix for the time series of stock 
exchange indices seems to be connected with the difference in operating hours, a result already obtained in 
PQ, separating the indices in two groups, Western and Eastern ones. By lagging the second group of indices in 
one day, we then obtain a new structure for the second eigenvalue, separating the European cluster from the 
others. 

We could also see that Central Europe exhibits high values of centrality according to four centrality measures, 
two of them (node degree and node strenght) based on the number and strenght of connections of a node, one 
of them (eigenvector centrality) which also takes into account how connected each nodes' neighbors are, and 
one (harmonic closeness centrality) measuring the average (inverse) distance of a node to all the others. We 
also saw that the Czech Republic and Singapore appear as highly central with respect to a fifth measure, 
betweeness centrality, which measures how often a node is in between shortest paths connecting other nodes. 
We also made a study of the evolution of the average node degree in time and showed it is highly correlated 
with the average correlation of nodes in time. 

All results showed there is a lot of information that can be obtained from asset graphs based on distance 
thresholds, and that the most significant ones seem to be those around the noise threshold. Most results have 
been compared with the ones obtained by simulations with randomized data, and they are remarkably different 
from those, except for high values of the threshold, which leads to the belief that the connections between 
international stock exchange indices are not formed at random, but have a complex structure underlying their 
behavior. The stability of some of the main connections also reinforce the idea that those ties are not formed 
at random. Finally, asset graphs do not exhibit characteristics of scale-free networks when analyzed in their 
centralities, although they are still quite different from random ones in terms of these same centrality measures. 

Acknowledgements 

The author acknowledges the support of this work by a grant from Insper, Instituto de Ensino e Pesquisa. 
This article was written using I^TeX, all figures were made using PSTricks, and the calculations were made 
using Matlab and Excel. All data are freely available upon request on leonidassj@insper.edu.br. 



27 



A Stock Market Indices 



The next table (table 2) shows the stock market indices we used, their original countries, the symbols we 
used for them in the main text, and their codes in Bloomberg. In the tables, we use "SX" as short for "Stock 
Exchange". Some of the indices changed names and/or method of calculation and are designated by the two 
names, prior to and after the changing date. 



Index 


Country 


Symbol 


Code in Bloomberg 


North America 




f~1 I ~\ f r\r\ 

S&P 500 


United States of America 


S&P 


SPX 


AT 1 /~1 • j 

JNasdaq Composite 


United States of America 


Nasd 


CCMP 


S&P/TSX Composite 


Canada 


Cana 


SPTSX 


IPC 


Mexico 


Mexi 


MEXBOL 


Central America 




Bolsa de Panama General 


Panama 


Pana 


BVPSBVPS 


BCT Corp Costa Rica 


Costa Rica 


CoRi 


CRSMBCT 


Caribbean 




Jamaica SX Market Index 


Jamaica 


Jama 


JMSMX 


British overseas territories 




Bermuda SX Index 


Bermuda 


Berm 


BSX 


South America 




Ibovespa 


Brazil 


Braz 


IBOV 


Merval 


Argentina 


Arge 


MERVAL 


IPSA 


Chile 


Chil 


IPSA 


IGBC 


Colombia 


Colo 


IGBC 


IBC 


Venezuela 


Vene 


IBVC 


IGBVL 


Peru 


Peru 


IGBVL 


Western and Central Europe 




FTSE 100 


United Kingdom 


UK 


UKX 


ISEQ 


Ireland 


Irel 


ISEQ 


CAC 40 


France 


Fran 


CAC 


DAX 


Germany 


Germ 


DAX 


SMI 


Switzerland 


Swit 


SMI 


ATX 


Austria 


Autr 


ATX 


FTSE MIB or MIB-30 


Italy 


Ital 


FTSEMIB 


Malta SX Index 


Malta 


Malt 


MALTEX 


BEL 20 


Belgium 


Belg 


BEL20 


AEX 


Netherlands 


Neth 


AEX 


Luxembourg LuxX 


Luxembourg 


Luxe 


LUXXX 


OMX Stockholm 30 


Sweden 


Swed 


OMX 


OMX Copenhagen 20 


Denmark 


Denm 


KFX 


OMX Helsinki 


Finland 


Finl 


HEX 


OBX 


Norway 


Norw 


OBX 


OMX Iceland All-Share Index 


Iceland 


Icel 


ICEXI 


IBEX 35 


Spain 


Spai 


IBEX 


PSI 20 


Portugal 


Port 


PSI20 


Athens SX General Index 


Greece 


Gree 


ASE 
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Index 


Country 


Symbol 


Code in Bloomberg 


Eastern Europe 




PX or PX50 


(lyprh T?p"nnt>lip 

V — /. V V 1 1 J. \j\Zr yj IA U 


CzRe 


PX 


SAX 


J LKJ V < 1 lY 1 < 1 


Slok 


SKSM 

UlVkJ ±V_L 


Budapest SX Index 




Hung 


BUX 


BELEX 15 


V_J^l Uld 


Serb 


BELEX 15 


CROBEX 




Croci 


ORO 


SBI TOP 




Slov 


SBITOP 


SASE 10 


Rri^ma Fmn nPT"7PO"nviTi9 


BoHe 


SASX10 


MOSTE 


A/lnTifpn pffvn 


Mnnt 


MOSTE 


MBI 10 


A/lp ppn r\v\ l a 


Marp 


MBI 


WIG 


Poland 


Pola 


WIG 


BET 


Romania 


Roma 


BET 


SOFIX 


Bulgaria 


Bulg 


SOFIX 


OMXT 


Estonia 


Esto 


TALSE 


OMXR 


Latvia 


Latv 


RIGSE 


OMXV 


Lithuania 


Lith 


VILSE 


PFTS 


Ukraine 


Ukra 


PFTS 


Eurasia 




MICEX 


Russia 


Russ 


INDEXCF 


ISE National 100 


Turkey 


Turk 


XU100 


Western and Central Asia 




KASE 


Kazakhstan 


Kaza 


KZKAK 


CSE 


Cyprus 


Cypr 


CYSMMAPA 


Tel Aviv 25 


Israel 


Isra 


TA-25 


Al Quds 


Palestine 


Pale 


PASISI 


BLOM 


Lebanon 


Leba 


BLOM 


ASE General Index 


Jordan 


Jord 


JOSMGNFF 


TASI 


Saudi Arabia 


SaAr 


SASEIDX 


Kuwait SE Weighted Index 


Kuwait 


Kuwa 


SECTMIND 


Bahrain All Share Index 


Bahrain 


Bahr 


BHSEASI 


QE or DSM 20 


Qatar 




DSM 


ADX General Index 


United Arab Emirates 


UAE 


ADSMI 


MSM 30 


Ohman 


Ohma 


MSM30 


South Asia 




Karachi 100 


Pakistan 


Paki 


KSE100 


SENSEX 30 


India 


Indi 


SENSEX 


Colombo All-Share Index 


Sri Lanka 


SrLa 


CSEALL 


DSE General Index 


Bangladesh 


Bang 


DHAKA 
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Index 


Country 


Symbol 


Code in Bloomberg 


Asia-Pacific 




Nikkei 25 




Tana 


NKY 


Hang Seng 


rTon c KnTiff 


HoKo 


HST 


Shangai SE Composite 


China 


Chin 


SHCOMP 


MSE TOP 20 


1VT on on! i a 


A/Ton 


MSETOP 


TAIEX 


TVi i wn n 

1 CAil VV CXiLL 


Tsiw 


TWSE 


KOSPI 


South Korea 


SoKo 


KOSPI 


SET 


Thailand 


Thai 


SET 


VN-Index 


Vietnam 


Viet 


VNINDEX 


KLCI 


Malaysia 


Mala 


FBMKLCI 


Straits Times 


Singapore 


Sing 


FSSTI 


Jakarta Composite Index 


Indonesia 


Indo 


JCI 


PSEi 


Philippines 


Phil 


PCOMP 


Oceania 




S&P/ASX 200 


Australia 


Aust 


AS51 


NZX 50 


New Zealand 


NeZe 


NZSE50FG 


Northern Africa 




CFG 25 


Morocco 


Moro 


MCSINDEX 


TUNINDEX 


Tunisia 


Tuni 


TUSISE 


EGX 30 


Egypt 


Egyp 


CASE 


Central and Southern Africa 




Ghana All Share Index 


Ghana 


Ghan 


GGSEGSE 


Nigeria SX All Share Index 


Nigeria 


Nige 


NGSEINDX 


NSE 20 


Kenya 


Keny 


KNSMIDX 


DSEI 


Tanzania 


Tanz 


DARSDSEI 


FTSE/Namibia Overall 


Namibia 


Nami 


FTN098 


Gaborone 


Botswana 


Bots 


BGSMDC 


FTSE/JSE Africa All Share 


South Africa 


SoAf 


JALSH 


SEMDEX 


Mauritius 


Maur 


SEMDEX 



Table 2: names, codes, and abbreviations of the stock market indices used in this article. 
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